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Abstract 



We give a singular control approach to the problem of minimizing an energy functional for 
measures with given total mass on a compact real interval, when energy is defined in terms of a 
completely monotone kernel. This problem occurs both in potential theory and when looking for 
optimal financial order execution strategies under transient price impact. In our setup, measures 
or order execution strategies are interpreted as singular controls, and the capacitary measure is 
the unique optimal control. The minimal energy, or equivalently the capacity of the underlying 
interval, is characterized by means of a nonstandard infinite-dimensional Riccati differential 
equation, which is analyzed in some detail. We then show that the capacitary measure has 
two Dirac components at the endpoints of the interval and a continuous Lebesgue density in 
between. This density can be obtained as the solution of a certain Volterra integral equation of 
the second kind. 

Keywords: Singular control, verification argument, capacity theory, infinite-dimensional Riccati 
differential equation, optimal order execution, optimal trade execution, transient price impact 

1 Introduction and statement of results 
1.1 Background 

Let G : R+ — > R+ be a function. The problem of minimizing the energy functional 



over probability measures /i supported by a given compact set K C R plays an important role 
in potential theory. A minimizing measure /i*, when it exists, is called a capacitary measure, 
and the value Cap (K) := l/£(/x*) is called the capacity of the set K; see, e.g., Choquet (1954), 
Puglede (1960), and Landkof (1972). See also Aikawa & Essen (1996) or Helms (2009) for more 
recent books on potential theory. 
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In this paper, we develop a control approach to determining the capacitary distribution //* 
when K is a compact interval and G is a completely monotone function. In this approach, 
measures /i on K will be regarded as singular controls and £{n) is the objective function. Our 
goal is to obtain qualitative structure theorems for the optimal control n* and characterize \f 
by means of certain differential and integral equations. 

The intuition for this control approach, and in fact our original motivation, come from the 
problem of optimal order execution in mathematical finance. In this problem, one considers 
an economic agent who wishes to liquidate a certain asset position of x shares within the time 
interval [0,T]. This asset position can either be a long position (x > 0) or a short position 
(x < 0) . The order execution strategy chosen by the investor is described by the asset position 
X% held at time t £ [0,T]. In particular, one must have Xq = x. Requiring the condition 
Xt+ = assures that the initial position has been unwound by time T. The left-continuous 
path X = (-Xt)te[o,Tl will be nonincreasing for a pure sell strategy and nondecreasing for a pure 
buy strategy. A general strategy can consist of both buy and sell trades and hence can be 
described as the sum of a nonincreasing and a nondecreasing strategy. That is, X is a path of 
finite variation. 

The problem the economic agent is facing is that his or her trades impact the price of 
the underlying asset. To model price impact, one starts by informally defining qdXt as the 
immediate price impact generated by the (possibly infinitesimal) trade dXt executed at time 
t. Next, it is an empirically well-established fact that price impact is transient and decays 
over time; see, e.g., Moro, Vicente, Moyano, Gerig, Farmer, Vaglica, Lillo & Mantegna (2009). 
This decay of price impact can be described informally by requiring that G(t — s) dX s is the 
remaining impact at time t of the impact generated by the trade dX s . Here, G : — > R+ is 
a nonincreasing function with G(0) = q, the decay kernel. Thus, J s<t G(t — s) dX s is the price 
impact of the strategy X, cumulated until time t. This price impact creates liquidation costs 
for the economic agent, and one can derive that, under the common martingale assumption for 
unaffected asset prices, these costs are given by 

C{X):=\f f G(\t-s\)dX s dX t (1) 

1 J[0,T] J[0,T] 

plus a stochastic error term with expectation independent of the specific strategy X; see 
Gatheral, Schied & Slynko (2012). Indeed, let us assume that asset prices are given by 
Sf^ = Sf + Jq G(t — s) dX s where S° is a continuous martingale and J Q * G(t — s) dX s mod- 
els the price impact of the trading strategy at time t. Then, we assume that the order —dXt 
is made at the average price \{Sf_ + S*) and costs \{S£_ + S*)dX t , which corresponds to 
a block shape limit order book, see Alfonsi, Fruth & Schied (2010). Accumulating these costs 
over [0,T], integrating by parts twice, and taking expectations yields 



E 



/ \(Sl+Sf)dX t 
J[o,T] 1 



-5 °A + E[C(X); 



where we have used the fact that E 



I\0T] ^tdXi = —SqXq, due to the martingale assumption 

on S . Further details can be found in Gatheral et al. (2012). 

Thus, minimizing the expected costs amounts to minimizing the functional C(X) over all 
left-continuous strategies X that are of bounded variation and satisfy Xq = x and X?+ = 0. 
This problem was formulated and solved in the special case of exponential decay, G(t) = e~ pt , 
by Obizhaeva & Wang (2005). The general case was analyzed by Alfonsi, Schied & Slynko 
(2012) in discrete time and by Gatheral et al. (2012) in the continuous-time setup we have used 
above. We refer to Alfonsi et al. (2010), Alfonsi & Schied (2010), Gatheral, Schied & Slynko 
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(2011) , Predoiu, Shaikhet & Shreve (2011), Schied & Slynko (2011), and Gatheral & Schied 

(2012) for further discussions and additional references in the context of mathematical finance. 
Clearly, the cost functional C(X) coincides with the energy functional £{y ) of the mea- 
sure u x (dt) := dXt. So finding an optimal order execution strategy is basically equivalent 
to determining a capacitary measure for [0, T\. There is one important difference, however: 
capacitary measures are determined as minimizers of £ (p) with respect to all nonnegative mea- 
sures p on [0, T] with total mass 1, while v x may be a signed measure with given total mass 
v ([0,T]) = —x. This difference can become significant if G{\ ■ |) is only required to be positive 
definite in the sense of Bochner (which is essentially equivalent to C{X) > for all X), because 
then minimizers of the unconstrained problem need not exist. It was first shown by Alfonsi et al. 
(2012), and later extended to continuous time by Gatheral et al. (2012), that a unique optimal 
order execution strategy X* exists and that X* is a monotone function of t when G is convex 
and nonincreasing. This result has the important consequence that the constrained problem 
of finding a capacitary measure is equivalent to the unconstrained problem of determining an 
optimal order execution strategy. 

In this paper, we aim at describing the structure of capacitary measures/optimal order 
execution strategies. To this end, it is instructive to first look at two specific examples in which 
the optimizer is known in explicit form. Obizhaeva &: Wang (2005) find that for exponential 
decay, G(t) = e~ pt , the capacitary measure p* has two singular components at t = and t = T 
and a constant Lebesgue density on (0,T): 

»*{dt) = ^ S (dt) + J^dt + J ^ S T (dt). (2) 

Numerical experiments show that it is a common pattern that capacitary measures for non- 
increasing convex kernels have two singular components at t = and T = and a Lebesgue 
density on (0,T). However, the capacitary measure for G{t) = max{0, 1 — pi} is the purely 
discrete measure 

N 



^ 2 + N^\ N + lJ y p T ~- P h 



2 + N^\ iV + L 

where iV := [pT\ (Gatheral et al. 2012, Proposition 2.14). 

So it is an interesting question for which nonincreasing, convex kernels G the capacitary 
measure p* has singular components only at t = and t = T and is (absolutely) continuous on 
(0, T). It turns out that a sufficient condition is the complete monotonicity of G, i.e., G belongs 
to C°°((0,oo)) and (-l) n G^ is nonnegative in (0, oo) for n € N. More precisely, we have the 
following result, which is in fact an immediate corollary of the main results in this paper. 

Corollary 1. Suppose that G is completely monotone with G"'(0+) := lim^o £*''(£) < oo. Then 
the capacitary measure p* has two Dirac components at t = and t = T and is has a continuous 
Lebesgue density on (0,T). 



1.2 Statement of main results 

Our main results do not only give the preceding qualitative statement on the form of p* but 
they also provide quantitative descriptions of the Dirac components of p* and of its Lebesgue 
density on (0, T). To prepare for the statement of these results, let us first assume that G(0) = 1, 
which we can do without loss of generality. Then we recall that by the celebrated Hausdorff- 
Bernstein-Widder theorem (Widder 1941, Theorem IV. 12a), G is completely monotone if and 
only if it is the Laplace transform of a Borel probability measure A on JR + : 

G{t) = [ e~ pt \(dp), t > 0. 
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In particular, every exponential polynomial, 



d 

G(t) = ^e-"', (3) 

8=0 

with Aj, pi > and Aj = 1 is completely monotone. Another example is power-law decay, 

G(t) = - — for some 7 > 0, 

W (1 +t)T ' 

which is a popular choice for the decay of price impact in the econophysics literature; see 
Gatheral (2010) and the references therein. We assume henceforth that G"(0+) < oo, which is 
equivalent to 

p := J p\(dp) < oo and p 2 := J p 2 \{dp) < oo. (4) 

A crucial role will be played by the following infinite-dimensional Riccati equation for func- 
tions ip : [0, oo) x R\ ->■ E, 



'{t,pi,p2) + {pi + p2)tp{t,pi,p 2 ) = —(pi + j x(p(t,px,x)X(dxj S j (j> 2 + j x<p(t,x,p2)\(dx) 

(5) 

where ip' denotes the time derivative of ip, and the function ip satisfies the initial condition 

ip(0, pi, p 2 ) = 1 for all pt, p 2 > 0. (6) 

Remark 1. When writing (5) in the form ip' = F(ip) one sees that the functional F is not 
a continuous map from some reasonable function space into itself, unless A is concentrated on 
a compact interval. For instance, it involves the typically unbounded linear operator ip i— > 
(pi + p2) ( P- Therefore, existence and uniqueness of solutions to (5), (6) does not follow by 
an immediate application of standard results such as the Cauchy-Lipschitz/Picard-Lindelof 
theorem in Banach spaces (Hille & Phillips 1957, Theorem 3.4.1) or more recent ones such as 
those in Teixeira (2005) and the references therein. In fact, even in the simplest case in which 
A reduces to a Dirac measure, the existence of global solution hinges on the initial condition; it 
is easy to see that solutions blow up when ip(0) is not chosen in a suitable manner. 

We now state a result on the global existence and uniqueness of (5), (6). It states that the 
solution takes values in the locally convex space C(M^_) endowed with topology of locally uniform 
convergence. For integers k > 0, the space C fc ([0, oo); C(R^_)) will consist of all continuous 
functions ip : [0, oo) — > C(M^_) which, when considered as functions ip : [0,oo) — > C(K) for some 
compact subset K of M.\, belong to C k ([0, oo); C{K)). 

Theorem 1. When G"(0+) < oo the initial value problem (5), (6) admits a unique solution ip 
in the class of functions ip in C 1 ([0, oo); C(M+)) that satisfy an inequality of the form 

0<£(t,Pl,P2)<c(l+pi)(l + p2), (7) 

where c is a constant that may depend on ip and locally uniformly on t. Moreover, ip has the 
following properties. 

(a) ip is strictly positive. 

(b) ip is symmetric: ip(t,pi,p 2 ) = ip(t,p2,pi) for all (pi,p 2 ) £ M+. 
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(c) 1 = f (p(t, p, x) X(dx) = f ip(t, x, p) X(dx) for all p > 0. 

(d) <^eC 2 ([0,oc);C(]R 2 )). 

(e) For every t, the kernel ip(t, •, •) is nonnegative definite on L 2 (A), i.e., 

f(x)f(y)<p(t, x, y) \{dx) X(dy) > for f € L 2 {\). (8) 

(f) The functions (p(t, p\,P2) a nd tp' (t, pi, P2) satisfy local Lipschitz conditions in (pi,p2), lo- 
cally uniformly in t. 



In Section 1.3 we will discuss computational aspects of the initial value problem (5), (6). In 
particular, we will discuss its solution when G is an exponential polynomial of the form (3) and 
we will provide closed- form solutions in the cases d = 1 and d = 2. 

We can now explain how to use singular control in approaching the minimization of £(/«) 
or C{X). To this end, using order execution strategies X = (Xt) will be more convenient than 
using the formalism of the associated measures p(dt) = dXt because of the natural dynamic 
interpretation of X. Henceforth, a [0, T]-admissible strategy will be a left-continuous function 
(Xt) of bounded variation such that Xt+ = 0. Our goal is to minimize the cost functional C(X) 
defined in (1) over all [0, T]-admissible strategies with fixed initial value Xq = x. Clearly, this 
problem is not yet suitable for the application of control techniques since C(X) depends on the 
entire path of X. We therefore introduce the auxiliary functions 

E?(p):=[ e-M-^ dX s , forp>0. (9) 
J[0,t) 

These functions will play the role of state variables that are controlled by the strategy X. 

Lemma 1. For any [0,T]-admissible strategy X, the function E^(p) is uniformly bounded in 
p and t. Moreover, 

C(X) = f f E*(p) X(dp) dX t + \ ^(AX,) 2 , (10) 

J[0,T) J t< rp 

where AXt '■= Xt + — Xt denotes the jump of X at t. 



Proof. Clearly, \Ef(p)\ < ||X|| var , where ||X|| var denotes the total variation of X over [0,T]. 
To obtain (10), we integrate by parts to get 

C(X) = [ I G(t-s) dX s dX t + ^P- V(AX t ) 2 . 

J[0,T) J[Q,t) 1 ^ 

Now we write G(t — s) as J e~ p ^~ s ^ \(dp) and apply Fubini's theorem. □. 

The form (10) of our cost functional is now suitable for the application of control techniques. 
To state our main result, we let <p be the solution of our infinite-dimensional Riccati equation 
as provided by Theorem 1 and we define 

(fo(t) := (p(t, 0, 0) and i()(t,p):= / xip(t,x, p) \(dx) (11) 
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Theorem 2. Let X* be the unique optimal strategy in the class of [0, T]- admissible strategies 
with initial value Xq = x. Then 



x 2 



C{X " )= ^m- (12) 

Moreover, X* has jumps at t = and t = T of size 

AX* =AX* T = - x 

and is continuously differentiable on (0, T). The derivative £(t) = 4fX^ is the unique continuous 
solution of the Volterra integral equation 

at) = f(t)+[ t K(t, S )Z(s)ds, (13) 
J o 



where, for 



@(t,p) : =B±^A I x z p(t,x,0)X(dx) - I x z p(t,x,p)X(dx)+p z , 



the function f and the kernel K(-,-) are given by 



ay* r i r 

fit) = ^ L J e ~ pt& ( T ~ P) X ( d P)> s) = 1 Tpj e " P( ^ S)0 ( T " *> P) WP)- ( 15 ) 



Let us recall that we know in addition from Theorem 2.20 in Gatheral et al. (2012) that 
t G [0, T] I—)- XI is monotone. The identity (12) immediately yields the following formula for 
the capacity of a compact interval. 

Corollary 2. If G"{Q+) < oo, the capacity of a compact interval [a,b] is given by 

Cap ([a, b]) = 2ipo(b — a). 



1.3 Computational aspects 



In general, the Riccati equation (5), (6) cannot be solved explicitly. A closed- form solution 
exists, however, when G is an exponential polynomial as in (3), i.e., when A has a discrete 
support. Let us assume that X(dx) = Yli=o ^pi(dx), with po = < p\ < • • • < pd, \ > 0, and 
Yli=o^i = 1- All input that is needed in Theorem 2 are the values Pij(t) := <p(t, pi, pj), for 
< i, j < d. By Theorem 1, ip(t) is a symmetric matrix that solves the following matrix Riccati 
equation: 



(16) 



(3) 



±X iPi X jPj , Mf = -^i + 5 ijPi , M« = -(MW) T and = Accord- 



With M { - — -zp^yi^jyj, my — r "IJ 

ing to Levin (1959), the solution of this equation is given by 



p(t) = (fiW(t)l + i?( 2 )(i))(i?( 3 )(i)l + J R( 4 )(t))- 1 , 



where lj,- = 1 and 



R(t) 



exp t 



M( 3 ) 



m( 4 ) 



In the special cases d = 1 and d = 2, the solution of the Riccati equation (5), (6) becomes even 
easier and, to some extend, becomes explicit. We demonstrate this first for d = 1 and then for 
d = 2: 
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Example 1. In the case d = 1, G is of the form G(t) = A + (1 — X)e~ pt for some A G [0, 1) 
and some p > 0. Clearly, we can set A := without changing the optimization problem. Then 
p = p\ = p, and (5) becomes 

Voo = |<Poi> Voi+PVoi = ~(l + <Pn)<Poi ; ¥>ii + 2p¥>n = |(l + (pn) 2 . 

For the initial condition <fki(0) = 1, the preceding equation has the unique solution ipn = <poi = 
1 and (poo(t) = 1 + pi/2. The condition (59) thus reduces to = X t + £f (t)(l + p(T - t)), 
which easily yields (2) as unique solution. 

Example 2. In the case d = 2, we can assume that G is of the form G(t) = \ie~ plt + A2e~ P2< , 
where Ai + A2 = 1. Consider a solution ipij (i, j = 0, . . . , 2) of the matrix Riccati equation (16) 
with Aq = 0. We can simplify (16) by using the relation 



Xlifil + X 2 (fi2 = 1, 

Indeed, the equation for <pn then becomes 



0,...,2. 



(17) 



1 2 1 2 

ip' n + 2p 1 ipu = — (pi + Xipiifn + \2P2f12) = 2=(Pi + P2 + M(pi ~ p2)<Pii) ■ 

This is an autonomous ODE that, for the initial condition <pn(0) = 1, is solved by 



<Pii(t) = ci + 
where 



1 



_ 1 A 2 (pi -p 2 ) 
1 - ci 4 VP1P2PP 



exp 



2VP1P2PP 



■t + 



! A 2 (pi-p 2 ) s 
4 V P1P2PP 



P ■= X1P2 + A 2 pi and ci 



'PiP 



We can notice that <pn(+oo) = ci and <pn(— 00) 
Similarly, 



Ai(pi -P2) / 
2 



Al(pi-P2) 



1 2 

(P22 + 2p 2 <P22 = — (pi + P2 + X 2 (P2 ~ Pl)<f22) , 

2p 



which for the initial condition <p22(0) = 1 is solved by 



(18) 



<P22(t) = C 2 + 

where 



1 lAl(pi-p 2 ) 



1 - c 2 4 \J p\P2PP 



C2 



exp 



>p 2 p 



%VPiP2PP 

p 



'pip y 



t +- 



1X 2 2 ( P1 - P2 f 



4 VP1P2PP 



X 2 (pi - P2) ' 

From (17) we can now easily compute <pi2- 

Next, using once again (17), we find that <poi solves 



(19) 



1 



Vol + Pi V01 = 7p(P2 + Ai(pi - P2)<P0l) (Pi + P2 + Ai(pi - P2)<Pll)- 
lp 



That is, 



V01 + 



P1P + P2P A? 2 
—2p—-2p0>i-P2) <Pn 



P2 

V01 = [Pi +P2 + Al(pi - P2)<Pll] • (20) 

2p 
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We set B 



i \i(pi~P2) 2 

4 VP1P2PP ' 



l-c, 



Blj and k = ^f^, so that <p n (t) = d + Aie i t+J?i ■ 
Then, we can check that A k t+Bie- kt * s a som tion of the fundamental system. By using a 
variation of parameters, we get that the solution of (20) satisfying (poi (0) = 1 is given by 



<Poi{t) 



Ai^oi(+oo)e fci + 5 1 ^ i(-oo)e- fei + Q 



kt 



01 



A x e kt + 5ie 



-kt 



with 



(±oo) = ±VPiP2PP Ptf and ^ = Ai(1 _ (+oo)) + _ m (_oo)). 

Then, <po2 can be easily deduced from (17). 
It remains to compute <poO) which solves 

^00 = ^= [P2 + Al(pi - P2)V0l] • 

We set Cqi = \%(pi — p 2 )Coi and get after some calculations: 



1 



01 



fct - B]ke~ kt 



P\P2 Coi Aifee 
2p 



p (^ ie fe < + B ie - kt ) 2 



+ 



C&p - 4AiSipip 2 p 



2pp 



(ylie fct + Sie- fci ) 2 ' 



Thus, we finally get: 
yoo(i) 



2p 



1 



Aie fcf + B x e- 



kt 



Ax+Bt 



(21) 



+ 



Cfcp - AA 1 B lPlP2 p 
4Bikpp 



1 



A x e kt + Sie 



A 1 + B 1 



This completes this example. 



Given the solution tp of the Riccati equation, we can approximate the continuous time 
strategy by a discrete one as follows (xj will denote the trading size at time iT/N). 

• We first set xq = 2 ^^'(r) and Eq{p^) = xq, < £ < d. 

• Suppose that 1 < i < N and that Xi-\ and Ei-i(pi) have been computed. Then, we set 
thanks to (59): 

i— 1 p 

Xi = l-J2*j- £i-i(p)e^ T/A ^(T - iT/N, p)X(dp), Et(pt) = Ei-i{pi)e^ T ' N + Xi . 
j=o J 

• Set x N = 1 - Yl)~=o x j- 

Alternatively, we could have approximated the minimization of the cost (1) by the following 
discrete problem. Let Mjj = G (\i — < i, j < N, and consider 

1 N 
minimize —x T Mx over x £ W N+1 s.t Xj = 1. (22) 

i=0 
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The solution of this problem is obviously given by lT] ^_ 11 M~ 1 l, where 1; = 1 for < i < N. 
From a financial point of view, the minimization problem (22) gives the optimal strategy when 
it is only possible to trade at the times iT/N, while the original problem (1) allows to trade 
continuously. In potential theory, it corresponds to computing the capacitary distribution of the 
set {iT/N | i = 0, . . . , N}. It was shown in the proof of Theorem 2.20 in Gatheral et al. (2012) 
that for N f oo these cap2acitary distributions converge in the weak topology of probability 
measures to the capacitary distribution dX* constructed in Theorem 2. Explicit solutions of 
(22) for the choices G(t) = e~ pt and G(t) = {l-pt) + were given in Alfonsi, Fruth & Schied (2008) 
and Alfonsi et al. (2012) (note, however, that G(t) = (1 — pt) + is not completely monotone). 

We have computed and plotted the solutions given by both methods in Figure 1 for T = 1, 
N = 50, and X(dp) = 0.15 (dp) + 0.2*1 (dp) + 0.25 3 (dp) + 0.25 5 (dp) + 0.25 7 (dp) + 0.15 10 (dp). 
They are already rather close together for N = 50, and they merge when ./V — > +oo. Let us 
discuss briefly the time complexity of the two methods. The one given by (22) gets very slow 
when N gets large since it involves the inversion of a N x N matrix. Instead, when A has a 
discrete support, the matrix Riccati equation can be solved quickly and the algorithm above 
has a O(N) time complexity, which is much faster. However, this is no longer true when A does 
not have discrete support. In that case, we have to approximate A by a discrete measure, which 
means that we have to increase d. Doing so, will slow down the algorithm based on the Riccati 
equation. A rigorous treatment of the convergence rate and time complexity of both algorithms 
is beyond the scope of this paper and is left for future research. 



2 Proofs 



2.1 Proof of Theorem 1 

Let us write (5) in the form (p'(t) = F\(ip(t)), where 

Fx(f)(pi,P2) = -(pi+P2)f(pi,P2) + ^z(pi+j xf{ Pl ,x)\{dxj)(p 2 +J xf{x,p 2 )\(dxj). (23) 

Lemma 2. Suppose that A is supported by the compact interval [0, /0 max ]. Then (5), (6) admits 
a unique solution ip £ C 1 ([0, oo); C(IR^)). Moreover, ip has the properties (a), (b), and (c) in 
the statement of Theorem 1. 



Proof. Let J C M+ be any compact interval containing [0,/9 max ]. Then F\ defined in (23) 
maps C(J x J) into itself. Moreover, F\ is Lipschitz continuous with respect to the sup-norm 
on every bounded subset of C(J x J). Hence, the Cauchy-Lipschitz/Picard-Lindelof theorem 
in Banach spaces implies the existence of a unique local solution ipj G C 1 ([0, t j); C{ J x J)) for 
some maximal time tj > (Hille &: Phillips 1957, Theorem 3.4.1). We will show below that 
tj = oo. Then, if J D J is another compact interval, the restriction of *pj(t) to J must coincide 
with <^j(t) due to the uniqueness of solutions. This consistency then implies the existence and 
uniqueness of solutions ip G C 1 ([0,oo);C(R 2 + ))). Moreover, the uniqueness of solutions and the 
fact that both (5) and (6) are symmetric in p\ and p 2 implies that <p(t, p±, p 2 ) = <p(t, p 2 , p\) for 
all (pi,p 2 ), which is property (b) in Theorem 1. 

We now fix an interval J D [0,p max ]. Before proving that tj = oo, we will show that 

J <pj(t, p, x) \(dx) = 1 for p e J and t < tj. (24) 

This then will establishes property (c) in the statement of Theorem 1 for t £ [0,tj). Then we 
will use (24) to derive some estimates on ipj that will yield ipj > and tj = oo. 
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Figure 1: Comparison of the approximated optimal strategies (xj,0 < i < N) obtained with (22) 
and with the method based on the Riccati equation (slightly shifted to the right) . 




Figure 2: Comparison of the approximated optimal strategies given by the Riccati method and 
N = 50, with the optimal continuous one X£ (computed with iV = 1000) . 
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To prove (24), we let I(t,p) := J <pj(t, p,x) \(dx) and i/jj(t,p) := f xcpj(t, p, x) X(dx). We 
have 

I'(t, p) + pl(t, p) + P) = {p + i>j(t, p)) (p + J xl(t, x) A(dx)) . (25) 
This is a (non-homogeneous) afhne ODE of the form I'(t) = b(t) + A(t)I(t), where the operator 

(A(t)f)(p) = -pf(p) + ±{ P + Mt,P)) J xf(x)X(dx) 

is a continuous map from [0,5] into the space of bounded linear operators on C(J) for each 
5 < tj. Hence this ODE admits a unique solution in C 1 ([0, 5]; C(J)) with initial condition 
1(0, p) = 1. But (25) is solved by I{t,p) = 1, which which establishes (24). 
For the next step, we let 



t := inf it £ [0,tj) I min ipj(t, pi, p 2 ) < 0\ . 

Pl,P2&J > 



Since tpj is a continuous map from [0,tj) into C(J x J) and v?j(0) = 1, we must have to > 0. 
Due to (24) we have on [0,io) that 

PlP2 - i t , \ . , \ !± \ ^ \Pl Pmax 

~2=r < <-Pj{t,Pi,P2) + (pi + p2)<pj{t,pi,p2) < — • (26) 

When defining 

<pj{t,p l ,p 3 ) :=e t ^+^i P j(t,p 1 ,p 2 ), (27) 
the preceding inequality can be rewritten as 

ElEl . e t( P l+P2) < 0> ( t ,Pl,p 2 ) < + PmaX ^ 2 + PmaX ^ . e t(Pl+P2)_ 

2p ' ' 2p 

Integrating these inequalities yields that for < t < to 

2p(pi + P2) 

with the convention 1 ~ e pi ( ^ P2> = t for p\ = p 2 = 0. Hence 

<pAt,px,P2) < + (Pi + P^M + P^ n _ e -*(Pi+P2)). (29) 

2p(pi + P2j 

Inequality (28) ensures that to > tj. Both inequalities (28) and (29) ensure the solution (fj(t) 
does not explode in finite time, which by standard arguments yields that tj = +oo. This proves 
the global existence of solutions as well as property (a) in the statement of Theorem 1. □ 

The preceding lemma works only for measures A that are concentrated on a finite interval. To 
obtain solutions for more general measures A, we need to find upper bounds that are independent 
of 

/°max- To this end, we first derive such bounds for the function ip(t,p) defined in (11). 
By Lemma 2, this function is well-defined whenever A has compact support, and it follows 
from dominated convergence together with (28) and (29) that ip £ C 1 ([0, oo); C(R+)) and that 
tp'(t,p) = J xip'(t, p, x) X(dx). 
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Lemma 3. Under the assumptions of Lemma 2, we have 

< tp(t, p) < y for all p > 0. (30) 

Proof. The lower bound in (30) is clear from ip > 0. To prove the upper bound, we suppose by 
way of contradiction that there exist t, p, and e > such that ip(t,p) > e + p 2 /~p. Then there 
must be a compact interval J D [0, /9 max ] such that 



t £ := inf U > 



/° 2 1 

max ib(t, p) > — + e > 
peJ p ) 



is finite. Since ip(0,p) = p and p 2 < p 2 , the time t £ must also be strictly positive. Moreover, 
there exists p e G J such that 

max ^(r e ,/9) = ip{r £ ,p £ ) = — + e. 
P aJ p 

Then r e is the first time at which the function t i— > ip(t, p £ ) reaches a new maximum, and so 
i//(r e ,p e ) > 0. 

Integrating (5) with respect to p\ \(dp\) and evaluating at p2 = p £ gives 

ip'(T £ ,p £ ) + p £ ip{r £ , p E ) + y p 2 tp(T £ ,p,p £ )\(dp) = ^z(p £ + ip(T £ ,p £ ))(p 2 + y ^(r e ,p)A(dp)).(31) 

Since J" ip(r £ , p, p £ ) X(dp) = 1, the Cauchy-Schwarz inequality (or, alternatively, Jensen's in- 
equality) implies that f p 2 ip(r £ , p, p £ ) X(dp) > ip(T £ ,p £ ) 2 . Moreover, the definition of p £ and the 
fact that A is supported on J yield that J pip(r e , p) X(dp) < ~pip(T £ ,p £ ). Plugging these two 
inequalities into (31) leads to 

1p'(T e , p £ ) < jp(/0e + ll>(Te,pe)) (p 2 + p^(r £ ,p £ )) - p £ ^{r £ , p £ ) - i()(t £ , p £ f 



Pep 2 



2p \2p 

where p(-) is a polynomial function of degree two. It has the two roots — p £ < and p 2 /~p > 0. 
Therefore p(x) < for x > p 2 /~p and in turn > p{ip{T £ , p £ )) = ip'(r £ , p £ ), which contradicts the 
fact that i)'(T E ,p e ) > 0. □ 



Lemma 4. Under the assumptions of Lemma 2, we have 

Wi + P2) 

(Pl + f)(P2 + f) 



(32) 



< exp(-( Pl + p 2 )t) + p p (1 - exp(-( Pl + p 2 )t)) , 

2p(pi + P2) 

-(P1+P2) ^ — < <P {t, pi, p2) < -— — • (33) 

Ip Ip 
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Proof. The ODE (5) can be rewritten as 

f'{t,pi,p 2 ) + (pi + p 2 )<p(t, p U p 2 ) = — (pi + V0,Pl))(P2 + tp(t,p 2 )). 

Defining <p as in (27) and using the upper bound in (30) thus yields that 

2 2 

™* ■ e <"+"> < 0>(t, Pl ,p 2 ) < iPl + ^ )( _ P2 + ^ ■ e^+^. 
2p 2p 



(34) 



(35) 



Arguing as in the final step of the proof Lemma 2 now yields (32). By plugging (32) back 
into (34) and using once again (30) , we obtain (33) . □ 



Lemma 5. For all R, T > there exist constants L\, L 2 > depending only on R, T, p, and 
p 2 such that for all t £ [0, T] and pi, pi p 2 G [0, R], 



and 



\<p(t,p 1 ,p 2 ) - ip(t,p!,p 2 )\ < Li\pi - pi\ 
\ip'(t,pi,p 2 ) - ip'(t,pi,p 2 )\ < L 2 \pi - pi\. 



(36) 
(37) 



Proof. We consider pi,pi,p 2 > and define 

Ap 1 :=p 1 -p 1 , Atp(t) = cp(t,pi,p 2 ) - cp(t,pi,p 2 ) and Aip(t) = ip(t, pi) - ip(t, p\). 

By subtracting the equation (34) satisfied by ip(t, pi, p 2 ) from the corresponding one satisfied 
by <p(t,Pi,p2), we get 

Aip'(t) + <p(t, pi , P2 )Api + (pa + Pi)Aip{t) = — (p 2 + i>{t, p 2 )) (a Pi + A^(*)) . (38) 

This equation is a linear non-homogeneous ODE for A<p(t) and, since A(p(0) = 0, solved by 

A<p(t) = 
' V l 



( y ^z(p2+^{s,p 2 )) ~ ip(s,pi,p 2 ]jA Pl + — (p 2 + ^(s,/3 2 ))A^( S ) 



-(Pl+P2)(t~s) ^ 



Since \tp(s, p 2 )\ < P 2 /p, we get with (32) and sup a>0 



l-e" 



t that 



f\ r |Api | d , (39) 



\A<p(t)\<-( P2 + ^) / (\Api\ + \AiP(s)\)ds+ll + 



2p v p j 
Now, we have that 



2p 



o 



|A^)| 



< 



P 



((p(t,pi,x) - ip(t,p!,x))x X(dx) < J \(p(t,pi,x) - ip(t,pi,x)\x\(dx) 
J (|Api| + |A^(s)|) ds+[p+ (pi + y^yTJ J |Api|ds. 



P Jo V v P 7 P 

For the last inequality, we have used Fubini's theorem and (39). Now, Gronwall's Lemma gives: 



|AV^)| < \Api\t(£ + p+ (pi + exp (£t). 



(40) 
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Plugging this back into (39), we get the existence of a constant L 1; which depends only on R, 
T, ~p, and p 2 , such that 

\A<p(t)\ < Li|A Pl |. (41) 

Finally, using (40) and (41) in (38) and recalling the locally uniform bounds (32) and (30) on 
<p and tp gives (37). □ 

Now drop the assumption that A is supported on a compact interval and aim at proving 
existence and uniqueness of solutions in this general case. To this end, we take a sequence 
Rn t oo for which A([0, Rx]) > and define 

fn ■= x ^ Q l R jy flp.fln] and d\ n = f n d\, (42) 

so that each A n satisfies the assumptions of Lemma 2. By ip n we denote the corresponding 
solution of (5), (6) provided by that lemma. For each n > 1, we have 

Ri _ r 72 _ 

2 \ s P _. „2 



pXn(dp)> I p\(dp)=:p and p\:= / p X n (dp) < =: pg. (43) 



o 



Hence, Lemma 4 yields that for each n, 



„2 -2 



., . p, po(l - e -*(pi+P2)) (pi + fli)(p2 + 1 _ e -(pi+Pa)* 

e-to+P*) + PiMl J < W.Ct.^pa) < 1 + ^ (44) 

2/W1+P2) 2p P1 + P2 

and 

- {Pl+P2) _ a s <(4 , P1 „ 2) £ (^faii) . (45) 

2^o 2p 
Similarly, Lemma 5 yields that for all R, T > there is a constant L > such that for all n 

\tp n (t, pi, P2) ~ <Pn{t,pi, p 2 )\ < L\pi - Pi\ for all t G [0,T], pi,pi,p 2 G [0,JR], (46) 

The inequalities (45), (46) and the Arzela-Ascoli theorem imply that the sequence (v? n ) n eN 
is relatively compact in the class of continuous functions on [0, T] x [0, R] 2 whenever T, R > 0, 
and hence admits a convergent subsequence in that class. By passing to a subsequence arising 
from a diagonalization argument if necessary, we may assume that there exists a continuous 
function ip : [0, 00) x M. 2 , — >■ R + such that ip n — > if locally uniformly. 

The uniform bounds (44) and dominated convergence imply that 



1pn(t,p) ■= I X<p n (t,p,x)X n (dx) = j X(p n (t,p,x)f n (x)X(dx) (47) 

x<p(t, p, x) X(dx) = t/j(t, p) (48) 



locally uniformly in (t,p). Hence, F Xn ((p n (t))(pi, p 2 ) -> F x (ip(t))(pi, p 2 ), locally uniformly in 
(t, pi, p 2 ), where F\ n is defined through (23). Since ip' n = F\ n ((p n ), we conclude that ip' n — > F\(ip) 
locally uniformly in [0,oo) x K^_. Moreover, we have for each n that 

<Pn{t,Pl,P2)-l= f' n (s,pi,p 2 )ds. 







The left-hand side of this equation converges to ip(t,pi, p 2 ) — 1, whereas the right-hand side con- 
verges to Jq F\(tp(s))(pi, p 2 ) ds. This proves that tp solves (5) and that p> G C 1 ([0, 00); C(M+)). 
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Remark 2. By sending R\ to infinity in (44) we get that the solution ip constructed above 
satisfies the bounds 



e -« M+M , + ens^^p. < ^.») < i + fa±|H^±f)l^^! . (49) 

2p(p 1 + p 2 ) 2p pi + p 2 

From (30), (47), and the lower bound in (49) we get moreover that 

0<ip(t,p)<^. (50) 



Now we turn to prove the uniqueness of solutions in the class of functions ip 6 C 1 ([0, oo); C(M^_)) 
satisfying a bound of the form (7) To this end, let (p\ and (f2 be two solutions in that class and 
set 

8(t,p 1 ,p 2 ) = <P2(.t,Pi,p 2 ) ~ lf 1 (t,p 1 ,p 2 ). 

We will show that ||<5(i)|| 2 ~ ~ = for all t, whenever A is a positive finite Borel measure of 

_ 11 v ;|I L 2 (A®A) ' 1 

the form A = A + p, where p is a positive finite Borel measure with compact support. Taking, 
for instance, p as the Lebesgue measure on [0, R] will then imply that 5(t,pi,p 2 ) = for 
p\ipi € [0, R\. So this will give the uniqueness of solutions. 
Let us define F\ as 

F x (f)(pi,P2) = ^(pi + f xf(p 1 ,x)\(dxj)(p2 + f xf(x,p 2 )\{dx)). (51) 



Lemma 6. We have F\(tpi(t)) £ L 2 (A (g> A) and 

\\F X (Mt)) - hfami^ * wwwifrx) + (52) 

where C is a positive constant that depends only on ~p and p 2 . 

Proof. For simplicity, we will drop the argument t throughout the proof. We may write 

= ^= J x6(pi,x) \(dx)(^p2 + J x(pi(x,p 2 ) X(dx)^j 

+ J x5(x, p 2 ) \{dx) (^pi + j xipi(pi,x) X(dx) S j + J x6(p\, x) X(dx) J x5(x, p 2 ) X(dx) 

Thus, 

(F A (^)-F A (^)) 2 



3 



4 2 

xS(p\,x) X(dx)[p 2 + / xtpi(x, p 2 ) \(dx) 



+ x6(x, p 2 ) \{dx) (^pi + J x<pi(pi,x) X(dx)^j\ + xS(pi,x) X(dx) J x5(x, p 2 ) \{dx) 



2 
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Now we integrate this inequality with respect to X(dpi) X(dp 2 ). The two first terms can be 
analyzed in the same way. First, we observe that J (p2 + =-) 2 \{dp2) is finite. Then we note that 

x5{p 1 ,x)X{dx)] / 5(pi,x) 2 X(dx) </? / 5( Pl ,x) 2 X(dx). (53) 



Hence, 

x6( Pl ,x)X(dx)\ X(d Pl ) <7\\6f LHm - 

Thus, the two first terms can be bounded by Coll^ll^^^)' where Cq is a constant that only 

depends on ~p and p 2 . Using once again (53), we get that the third term can be bounded from 
above by C\ II ^11^2 p;^)' where the constant C\ depends only on p and p 2 . □ 

Now we differentiate S 2 and integrate over [0,t]: 

/o 



5(t,p 1 ,p 2 ) 2 = -2 (pi + p 2 )5(s,p 1 ,p 2 ) 2 ds 
Jo 

+2 / 5(s,pi,p 2 ) F\((p 2 (s))(pi,p 2 ) - F x ((fi(s))(p 1 ,p 2 ) 
Jo L 

< 2 / 5(s,p!,p 2 ) F x (ip 2 (s))(p 1 ,p 2 ) - F x (<pi(s))(pi, p 2 ) 
Jo L 

We now integrate w.r.t. X(dpi) X(dp 2 ) and get by using the Cauchy-Schwarz inequality, 

J 

By continuity of t — > \\5(t, ■) [ I ^2 (A(g> A) ' we know that f° r eacn T > there is a constant K such 
that ||<5(t, •) 11^2 (A(g,A) ^ K when t £ [0,T]. Thus, we get from Lemma 6 that 



which in turn gives that ||<5(i, Oll^a/^X) = ^ 0n [^'^ Gronwall's Lemma. This concludes the 
proof of uniqueness. 

Now we turn to proving the properties (a) through (f) in Theorem 1. Property (a) (strict 
positivity) can be proved just as in the case of a compactly supported measure A in Lemma 
2. Property (b) (symmetry) is already clear. Property (c) (J ip(t, p, x) X(dx) = 1) follows from 
the corresponding property of the approximating functions <p n , the uniform bounds (44), and 
dominated convergence. 

Property (d) states that ip £ C 2 ([0, 00); C(IR+)). By dominated convergence and the bound 
(45), which also holds for ip' in place of (p' n , we get that 4>(t,p) belongs to C 1 ([0, 00); C(M+)). 
Thus, our ODE gives ip' £ C 1 ([0, 00); C(R+)), which proves property (d). 

We now prove property (e). It is clearly enough to prove it when / : R+ — > R is a bounded 
measurable function with compact support. To this end, let ip(t,pi,p 2 ) := e*( pi+P2 V(^ Pii 92)- 
Then ip belongs to C 1 ([0, 00); C(R\)) and 

0'(t, P i,p 2 ) = ^- e t ^ + ^(p 1 + ^(t, Pl ))(p 2 + p 2 )). 
2p 
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That is, (p'(t, pi, p 2 ) = g(t, pi)g(t, P2) for a function g. Thus, 

f(xi)f(x 2 )(f'(t,xi,x 2 )X(dx 1 )X(dx 2 ) = ( [ f(x)g(t,x)X(dxj) > 0. 



Since tp(t) = 1 + f <p'(s) ds, we find that ip(t) is nonnegative definite. Finally, with f(x) 
e~ tx f(x), 

f(xi)f(x 2 )<p(t,xi,x 2 )\(dxi)\(dx 2 )= / / f(xi)f(x 2 )if(t,xi,x 2 )X(dxi)\(dx 2 )>0. 



This establishes property (e) in Theorem 1. 

Finally, property (f) (the local Lipschitz property for ip and ip 1 ) follows just as in Lemma 5. 
This concludes the proof of Theorem 1. □ 

2.2 Proof of Theorem 2 

The strategy in the proof of Theorem 2 is to use a verification argument and based on guessing 
the optimal costs V(T, E(-),x) for liquidating x shares over [0, T] with additional and arbitrary 
initial data E(-). The result of our guess is formula (54) below. We explain its heuristic 
derivation in Appendix A. 

Let <p be a solution of the infinite-dimensional Riccati equation (5), (6). This solution gives 
rise to a family of linear operators $4 : L 2 (X) — > L 2 (X) n C(R+) defined by 



*t/(p) = / /(*M*. P) Kdx), f G L 2 {\). 

By (7), t 1 — y &tf is a continuous map into both L 2 (X) and C(R+) for each / G L 2 {\). By the 
inequality (45), which also hold for ip in place of <p n , t 1— > $t/ is a continuously differentiable 
map into both L 2 (X) and C(M+) for each / G L°°(A). 
For t > 0, E(-) G L 2 (A), and x G R, we define 

7(t, := i [^ttt - (<M?)(0)) 2 - $ t E)} , (54) 

where (■,■) denotes the usual inner product in L 2 (X). For t G [0, T] and a [0, T]-admissible 
strategy X we define 

Cf := / / Ef dXdX s + i T(AX S ) 2 + V(T - t, E? , X t ). 
J\o,t) J 2 TZ 



s<t 



By Lemma 1, the first two terms on the right correspond to the cost accumulated by the strategy 
up to time t. Moreover, 

V(0, E$,X T ) = ^(X T ) 2 -J E$ dX-X T = ^(AA T ) 2 + J E$ dX- AX T , 

due to the requirement Xf + = 0. This gives = C(X). Our goal is thus to show the following 
verification lemma: dC^ > with equality if and only if X = X* for a certain strategy X* . This 
will identify X* as the optimal strategy and V(T,E(-),x) as the optimal cost for liquidating x 
shares over [0, T] with additional initial data E(-) at time t = 0. In the formalism of potential 
theory, V(T, 0, —1) will then be the minimal energy of a probability measure on [0, T]. 
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Lemma 7. For every [0,T]-admissible strategy X, Cf is absolutely continuous in t and 



del 

dt 



^(T-t,0) 

MT-t) 



X t -{$T-tE?)(0)) + / E?(p)(p + ^(T-t,p))\(dp) 



(55) 



for a.e. t€ [0,T]. 



Proof. Recall the following integration by parts formula for left-continuous functions at, fit of 
locally bounded variation: 



atPt - a s /3 s 



/ a r d/3 r + / /3 r da r + Aa r A/3 r . 
■/[»,*) J[s,t) p6M 



It follows that 1 1 — y Ef (p) is of bounded variation and 

Ef (p) - Ef (p) = X t - X s - p f E? (p) dr (56) 

J s 



as well as 



= [ (E?( Pl ) + E?( P2 ))dX r - [ (p 1 +p 2 )E r x (p 1 )E r x (p 2 )dr + £ (AX r ) 2 . 

J\s,t) JS _r ,\ 



(57) 



r£[s,t) 



[«■*) 

Therefore, 

^p(T-t,p 1 ,p 2 )Ef(p 1 )Ef(p 2 ) - ip{T-s, Pl ,p 2 )E?{ Pl )E?{ P2 ) 

= - [ ^(T-r,p 1 ,p 2 )E r x (p 1 )E r x (p 2 )dr- [ ip(T - r, Pl ,p 2 )E? (p 1 )E r x (p 2 )( Pl +p 2 )dr 

J S J S 

+ [ V (T-r,p 1 ,p 2 )(E r x (p l ) + E r x (p 2 ))dX r + <p(T-r, Pl , P2 ) (AX r ) 2 . 

J ^ re[s,t) 

We have already observed in the proof of Lemma 1 that \Ef(p)\ is uniformly bounded in 
r 6 [0, T] and p > by the total variation of X. Hence we may integrate both sides of the 
preceding identity with respect to X(dp%) X(dp 2 ) to obtain, with the symmetry of (p and the 
notation Ef := pEf(p), that 

= - ! (Ef, & T _ r E?) dr-2 f {E?,$ T _ r EX) dr + 2 ! (1, Ef) dX r + ^ (AX r ) 2 . 
Js Js J[ s ,t) re[Sjt) 

By a similar reasoning we obtain 

$ T -t£f (0) - $ T -.E?(0) = - f & T - r E? (0) - / $ T _ r £ r x (0) + X t - X s 

J s J s 



and 



(X r -* T _ r ^*(0)) (*' T _ r ^(0)+$ T _ r i^(0)) dr. 
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Using these formulas, we can now compute 



a x - c 



X 



t 40 f ^(X r -^ r E?(O)) 2 dr 
<p$(T-r) 



+ 



ip (T-r) 



1 



+ ^ / (E?,<$>' T _ r E?)dr 



{E?,<5> T ^ r E?)dr 



Therefore, Cf is absolutely continuous on [0, T] and has the derivative 



dCi 



x 



dt 



Vo(r-t) 
2^(T-i) 



(X t -* T -t^(0)) 



(p (l -t) 'I 

for a.e. t. 

To further analyze the preceding formula, we take an extra point A. We let A := A + 5a and 
extend Ef and tp to functions on {A} U [0, oo) by putting 



Ef (A) := 



ip (T - 1 

t > 0, tp(t, A, p) = tp(t, p, A) := 0, p), 

fit, A, A) := p(*,0,0) = po(<)- 



X, - ($ r _ t £*)(0)) 



(58) 



We furthermore define the function 



x if x G [0, oo) 
if x = A, 



and we extend the definition of -E 1 ^ via Ef(x) = f(x)Ef(x) for x G {A} U [0, oo). Finally, we 
set for g G L 2 (X), <&tg{x) = f ip(t, x,y)g(y)\(dy) and one easily checks that : L 2 (X) — )• L 2 (A). 
With this notation, we get 



del 

dt 



i E f, ^ T -t E f + (Ef,^T-tEf) L 2Q^ + ($T-tE?,E?) L 2(%) 



1 

2 
1 

2 
1 

2 
1 
2 
1 

2 

where we have used the Riccati equation (5) and the notation (11) in the fourth step. This 
proves the assertion. □ 

It follows from the above that a [0, T]-admissible strategy X* with Xq = x satisfies 

C(X*) = V(T,0,x) < C{X) 



Ef(x)Ef(y)^'(T-t,x,y) + (f(x) + f(y))<p(T - t,x,y)j X(dx) \{dy) 
E?(x)E? (y) (f(x) + ^{T - t, x)) (f(y) + V(T - t, y)) X(dx) X(dy) 

Ef (x)(f{x) + ip(T-t,x)) X{dx) 
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for all other [0, T] -admissible strategies X with Xq = x if dC^* jdt vanishes for a.e. t £ [0, T]. 
Using (55), we write this latter condition as 

= XI + J Ef (p)9{T - t, p) \{dp) for a.e. t, (59) 

where 

at \ Mt){p + 4>{t, p)) 

0(r,p) = ^ V {t, P) 0). 

Then 

J m 
Plugging this and (56) into (59) yields that for a.e. t 

X: = X*Jl- + I* fpEf(p)9(T-t,p)X(dp)ds. (61) 

Thus, the left-continuous function X* coincides with an absolutely continuous function for 
a.e. t G [0, T]. It follows that these two functions coincide for every t 6 (0, T]. Thus, E x * is 
continuous on (0, T] by (56), which in turn implies via (61) that X* is continuously differentiable 
throughout (0, T). 

When taking the limit t \. in (61), we get 



0+ V (OnTffii' 



^ (T)2p, 
which gives 

y>(r,o) 

AXo = -2^(T) s - (62) 

That AJ * = AXf, follows from Remark 2.10 in Gatheral et al. (2012). 

Since ip(t,p) is continuously differentiable in t, 0(t,p) is also continuously differentiable in t. 
Differentiating (59) with respect to t € (0, T) yields 

= jX* + f ^f^0(T - t, p) \{dp) - f Ef {p)9'{T - t, p) \{dp) 

= 2 iIt {t _~v] • j t x > - I Ef {p) {0,{T - *- p) + p9[T - *' p)) x{dp) ' 

where we have used (56) and (60) in the second step. This gives 

j f X^ = ^-^^ J Ef(x)(9'(T-t,x)+x9(T-t,x))X(dx). (63) 

We now want to simplify (63). To this end, we use the notation tfi(t) := f xip(t,x) X(dx). 
and the formulas 

<p'(t,p,0) = -p<p(t,p,0) + ^(p + i;(t,p))^(t,0), V ' (t) = —^(t,0) 2 
fj/(t,p) = - J x 2 <p(t,x,p)\(dx)-p^(t,p) + 1 ^(p + ij(t,p))(p2+^(t)) (64) 
0) = - j x 2 ip(t,x,0)X(dx) + ^(t,0)(7 + ^(t)), 
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Then a tedious computation shows that 



8\t,p)+pe(t,p) = -^yQ(t,p), 

where @{t,p) is as in (14). Therefore, (63) becomes 

jX* t =^ J Ef(x)@(T-t,x)X(dx). (65) 

Now we have E** = AXg + J Q * e~ p ^ _s ^X t * ds. Plugging this formula back into (65) and using 
Fubini's theorem yields that = 4tX% solves the Volterra integral equation (13). This is 
a Volterra integral equation of the second kind with continuous kernel K(t,s) and continuous 
function f(t). It hence admits a unique continuous solution x(-) (Linz 1985, Theorem 3.1). 
Conversely, given such a solution x(-), we can define a [0, T] -admissible strategy X* via (62) 
and x(t) = f t X* t . Then X* satisfies (61) for t = 0+ as well as (63) for t > 0. Integrating (63) 
and reversing the steps made above in deriving (63) from (61) shows that X* satisfies (61) for 
t G (0, T], and so X* is optimal. This concludes the proof of Theorem 2. □ 



A Heuristic derivation of the value function 

We want to explain here how it is possible to guess the value function V(t, E(-), x) introduced 
in (54). We start our discussion by deriving a formula for the costs C(X) of a strategy (X s , s G 
[0,T]) that is arbitrary on [0,t) and optimal on [t,T]. We set 



C t (X) := C{X)-\! [ G([s-r\)dX r dX s 
1 J[o,t) J[o,t) 

= nf I G([s-r\)dX r dX s + [ [ G(\s-r\)dX s dX r 

2 J\t,T] Jh,T] J\0,t) J\t,T] 



'[t,T]J[t,T] J[0,t)J[t,T] 

The rightmost integral can be written as 



/ / G(\s - r\) dX r dX s = f A{s)dX s 

J[t,T] J[0,i) J[t,T] 



where 



A(s) = J Ef(p)e- p(s ~ t) X{dp), t<s<T. 
The first-order condition of optimality thus reads 

/ G(\s - r\) dX r + A(s) = v iort<s<T, (66) 

J[t,T] 

where v is a suitable Lagrange multiplier (compare Theorem 2.11 in Gatheral et al. (2012)). 
Lemma 8. Suppose that R and R are functions with finite variation such that Rt = Rt and 

[ G(\s-r\)dR s = [ G(\s-r\)dR s for all r G [t,T]. 

J[t,T] J[t,T] 

Then R s = R s for all s G [t, T] . 
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Proof. We have Jj t T , G(\s — r\) d(R s — R s ) = and hence that 



I I G(\s-r\)d(R s -R s )d(R r -R r ) = 0, 

J\t,T] J\t,T] 



<[t,T] J[t,T] 

which implies the assertion in view of the fact that G is strictly positive definite. 



□ 



Now suppose that we have auxiliary functions with finite variation Bt(p) such that Bxip) = 



and 



We also define 



/ G{\s - r\) dB r {p) = e- p{s - t] for t < s < T. 
J[t,T] 



Z* :-- 



E?(p)B s (p)\(dp), 



so that 



Therefore, 



A(s) = [ G(\s - r\) dZ r , for t < s < T. 
J[t,T\ 



(67) 



/ G(\s - r\) d(X r + Z r ) = v = v [ G(\s - r\) dB r (0) for t < s < T. 

J[t,T] J[t,T] 

Lemma 8 hence implies that X s + Z s = vB s (0) for t < s < T. Hence, we get 
v = 4±j£, x K = 44^ B,(0) - Z s for s 6 [t, T}. 



Bt(0) 

From these identities we get 
1 



Bt(0) 



Ct(X) 



[t,T] J[t,T] 



G([s-r\)dX r dX s + / A{s)dX, 



[t,T] 



\u 2 I I G(\s-r\)dB r (p)dB 8 (p)-v [ I G(\s - r\) dB r (0) dZ s 

1 J[t,T\ J[t,T] J[t,T] J[t,T\ 

+1 I I G{\s-r\)dZ r dZ s + f f G(\s-r\)dZ r dX s 

1 J[t,T] J[t,T] J[t,T] J[t,T] 

(X ' + Z ' )2 - / / G(\s-r\)dZ r dZ s 

J\t,T] J\t,T] 



~B t (0) J[t,T]J[t,T] 

since the first double integral is equal to —Bt(0). Now we define 

V (T-t, Pl ,p 2 ) := I e-Mr-t) dB r (p 2 ), pi,p 2 >0. 

J[t,T] 

Then, we have 



<p(T - t, pt, p 2 ) = / G{\r - s\)dB s (p 1 )dB r {p 2 ) = <p(T - t, p 2 , pi), 

J[t,T] J[t,T] 

I ' <p(T-t,p 1 ,p 2 )X(dp 1 ) = [ G(r - t)dB r { P2 ) = 1. 

J J\t,T] 
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Moreover, we observe that <p(T — t,0,p) = —B t {p) since -Br(p) = 0. This gives in particular 
that Z t = - J E?(p)(p(T - t, 0, p)\{dp). Besides, we have 



/ / G(\s -r\)dZ r dZ s = [ A(s) 

J[t,T] J[t,T] J\t,T] 



[t,T] 

[t,T] 



E?( Pl )E?(p 2 ) / e-" 1 '-*) dB s ( P2 ) X(d Pl ) X(d P2 ) 

Jim 



E?( Pl )E? (p 2 )<p(T - t, Pl , P2 ) A(dpi) \{d P2 ] 



?x 



Thus, we obtain altogether that 



Ct(X) 



1 



X t -fEf(pMT-t,0,p)\(dp) 
<p(T-t,0,0) 



E?{ Pl )E? ( P2 MT-t, PUP2 ) X{d Px ) \{dp 2 



This suggests the definition 

! (x-fE(p)<p(T-t,0,p)\(dp)) 



\ 2 



V(t,E(.),X) :-- 



so that we have 



<p(t,0,0) 



E{p 1 )E(p 2 )ip(t, pi,p 2 ) X(dpi) X{dp 2 



C(X)= [ [ Ef(p)X(dp)dX s + lj2( AX s) 2 + V(T-t,E t (.),X t ). 
J[o,t) J 2 ^ 

Thus, we have obtained the formula given by (54), and it remains to explain why 92 should solve 
a Riccati equation. To do so, we consider an arbitrary strategy (X s ,s £ [0,T]) and consider 
the cost (68), which is the cost of the strategy that is equal to X on [0, t) and optimal on 
[t,T]. To make the dependence on t explicit, we denote this cost by Ct{X). To simplify things, 
we will focus on the particular case (3) of a discrete measure X(dx) = Y2i=o Xid Pi (dx), with 
Po = < pi < ■ ■ ■ < pd, Xi > 0, and Yli=o = 1- With this choice, V only depends on E(pi), 
< i < d. We introduce the following notations: 

tp(t,pi,pj),0 < i,j < d, 



V(t,E ,...,E d ,X) 



d d 



—^777 (x-Y] \i£k<poi(tj) 2 -VV XiXjEiEjtpijit) 



i=0 



i=0 j=o 



x 



E 



Ct(X) 



Y^XiEf{ Pi ), 

i=0 

[ E X S dX s + \ ^2(AX S ) 2 + V(T - t, Ef(p ), Ef(p d ),X t ). 
J\o,t) ^ 



'[o,t) - s<t 

Lemma 9. We have AC t (X) = for all t G [0,T]. 

Proof. Note that AX t = AE?(p). Hence it is clear that AC t (X) = if AX t = 0. Now suppose 
that AX t / 0. Then 



d 1 
AC t (X) = XiE?( Pi )AX t + -(AI f ) 2 + ACt(X). 



(69) 



i=0 
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On the other hand, we have 

V(t,E + S,...,E d + 6,X + S) 

d _ d d 



(X + S-J2 \i(Ei + <5)^(t)) 2 - EE XiX ^ Ei + *)(^ + ^M*) 

^ 00 ^ ' i=0 i=0 j=0 

d d i-odd 




V(t, E ,...,E d ,X)-5^2J2 WjEi<Pij(t) - — ^ E A ' A /r,,{/i 

j=0 i=0 j=0 i=0 

5 2 



V(t,E ,...,E d ,X)-Sj2^ R ' 



8=0 

Here we have used the facts that Yli Aj = 1 and 2~Z« A^j = 1- Putting everything together 
yields the assertion. □ 

We can now focus on infinitesimal variations, and we denote Vt := dV/dt, Vi := dV/dEi and 
V x := dV/dX. We have, when AX t = 0, 



dC t (X) = E? dX t -V t dt + J2 V i dE ?(Pi) + V xdX t 

i=0 

d d 



dt 



i=0 i=0 



By simple calculations, we get E t + ^» + = ^' an< ^ our expression simplifies to 



dC t (X) = -(Vt + J2 Pi E ?(pi)^) dt 



i=0 

Let us now calculate Vt- 

i d 2 d d I i d 

v t = ~^r( x ~ E x j e jvv) ~ E x ^ ( x " E S 21 " 9 E ^J E i E Mr 

^00 V i=0 i=0 i=0 ^ 00 Z y=0 



To simplify computations, we define 



A_ i: =l and E _ i:= 5z?k*WML (70) 

^oo 



as well as <p—u := (foi and (f-i-i = ^oo- Then 

1 d 

v t = ~2 E ^EiXjEjtp'ij. 

With p_i := po = 0, we get ^ = -Aj ^A_i£'_i^_ij + ^ =0 XjEjcpij) = -A, Ej=-i XjEjipij and 
therefore 

d d 

^^PiEiVi = — 2 ^i^iXjEjipij. 

i=0 i,j=—l 
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Altogether, we obtain 

1 d 

where E^(p^i) is defined according to (70). 

Thus, we arrive at the following quadratic form \ Ylt 1=0 EkEi\kXi(tp' kl + (pk + pi)<Pkl)- Since 
we should have dCt{X) > with dCt(X) = for the optimal strategy, this quadratic form should 
be nonnegative with rank one. Indeed, since the control X is of dimension one, it would not 
be possible to make dCt(X) = if the rank of the quadratic form were higher than two. We 
are now going to write the conditions that ensures that this quadratic form is nonnegative with 
rank one. To do so, We introduce the new coordinates (Ao, . . . , A^) such that 

E = A , E 1 = A + A 1 , ■■■ E d = A + A d . 

In these coordinates, our quadratic form becomes 



-A^Aj^'oo + 2/woo) + A o( A o + Aj)AoAi(<Poi + (po + < 

i=i 

^ d d 

+9 J2( A ° + Afc )( A ° + A i)A* Ww + (pk + pi)<p 



o / j / j \ — « ' — r»/\ — " ' — U"«-i\rH ' \rn. ■ ri/rklj 
k=l 1=1 

After some calculations, we get that the coefficient for Ag, AoA;, A^A; and A^ (for 1 < k, I < d) 

are respectively p, \i(pi + Y1=q ^kPk<Pkl), hk(<f' k i + (pk + Pl)<Pkl) and ItfWu + 2 Pi ( Pu)- 
Thus, the matrix Q for the quadratic form has coefficients 

Qoo = P, 

A d 
Qoi = Qio = y (pi + Y ^kPWki) , 

k=0 

Qkl = Qik = -^khiv'hl + (Pk + Pl)<Pkl) if k, I > 2, k 7^ /, 

Since Q is of rank one, the determinant of the matrices 

Qoo Qoi \ / Qoo Qoz 

Qio Qu / V Qfco 

must vanish for 2 = 1, . . . , d and k < I. That gives, respectively 

1 d 
f'll + 2 /W/ = ^(pi + ^2 X kPWkl 

k=0 
d 



f'kl + (Pk + Pl)Vkl = ^^Pl + ^kpmi)(pk + ^2^jPj<Pkj), (71) 



i=0 j=0 



which gives precisely the Riccati equation. Thus, equation (71) holds for 1 < k, I < d. In fact, 
the choice of Eq = Ao is arbitrary. Had we chosen Ei = A, for some i > 0, we had obtained (71) 
for k,l 7^ i. Therefore (71) holds in fact for all k, I = 0, . . . , d. 
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